Learning reward timing in cortex through reward dependent expression of synaptic plasticity.
نویسندگان
چکیده
The ability to represent time is an essential component of cognition but its neural basis is unknown. Although extensively studied both behaviorally and electrophysiologically, a general theoretical framework describing the elementary neural mechanisms used by the brain to learn temporal representations is lacking. It is commonly believed that the underlying cellular mechanisms reside in high order cortical regions but recent studies show sustained neural activity in primary sensory cortices that can represent the timing of expected reward. Here, we show that local cortical networks can learn temporal representations through a simple framework predicated on reward dependent expression of synaptic plasticity. We assert that temporal representations are stored in the lateral synaptic connections between neurons and demonstrate that reward-modulated plasticity is sufficient to learn these representations. We implement our model numerically to explain reward-time learning in the primary visual cortex (V1), demonstrate experimental support, and suggest additional experimentally verifiable predictions.
منابع مشابه
The emotive brain, the noradrenergic system, and cognition
Motivation and attention can have a profound influence on perception, learning and memory. Neuromodulatory systems, especially the noradrenergic (NE) system, co-vary with psychological states to modulate cortical arousal, influence sensory processing and promote synaptic plasticity. There is even some suggestion that the NE system might facilitate functional recovery after brain damage. Post-sy...
متن کاملThe emotive brain, the noradrenergic system, and cognition
Motivation and attention can have a profound influence on perception, learning and memory. Neuromodulatory systems, especially the noradrenergic (NE) system, co-vary with psychological states to modulate cortical arousal, influence sensory processing and promote synaptic plasticity. There is even some suggestion that the NE system might facilitate functional recovery after brain damage. Post-sy...
متن کاملA Learning Theory for Reward-Modulated Spike-Timing-Dependent Plasticity with Application to Biofeedback
Reward-modulated spike-timing-dependent plasticity (STDP) has recently emerged as a candidate for a learning rule that could explain how behaviorally relevant adaptive changes in complex networks of spiking neurons could be achieved in a self-organizing manner through local synaptic plasticity. However, the capabilities and limitations of this learning rule could so far only be tested through c...
متن کاملReinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity
The persistent modification of synaptic efficacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spike-timing-dependent plasticity (STDP). Here we show that the modulation of STDP by a global reward signal leads to reinforcement learning. We first derive analytically learning rules involving reward-modulated spike-timing-dependent synaptic and intri...
متن کاملTowards a learning-theoretic analysis of spike-timing dependent plasticity
This paper suggests a learning-theoretic perspective on how synaptic plasticity benefits global brain functioning. We introduce a model, the selectron, that (i) arises as the fast time constant limit of leaky integrate-and-fire neurons equipped with spiking timing dependent plasticity (STDP) and (ii) is amenable to theoretical analysis. We show that the selectron encodes reward estimates into s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings of the National Academy of Sciences of the United States of America
دوره 106 16 شماره
صفحات -
تاریخ انتشار 2009